AITopics | integral representation

An approach to construct explicit integral representations for two-layer ReLU networks is presented, which provides relatively simple representations for any multivariate polynomial. Quantitative bounds are provided for a particular, sharpened ReLU integral representation, which involves a harmonic extension and a projection. The bounds demonstrate that functions can be approximated with $L^{2}(\mathcal{D})$ errors that do not depend explicitly on dimension or degree, but rather the coefficients of their monomial expansions and the distribution $\mathcal{D}$. We also present a connection to the RKHS of the exponential kernel $K(x,y)=\exp\left(\left\langle x,y\right\rangle \right)$, and a very simple integral representation involving additionally multiplication via a fixed function which has better quantitative bounds.

artificial intelligence, machine learning, representation, (18 more...)

arXiv.org Machine Learning

2604.2326

Genre: Research Report (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.94)

Add feedback

fcc3dc27672a12510babe448d665e152-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-13-2026, 02:25:55 GMT

neural network, representation, ridgelet transform, (12 more...)

Neural Information Processing Systems

Country:

North America > United States (0.14)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > China > Beijing > Beijing (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Generalization Properties of Learning with Random Features

Alessandro Rudi, Lorenzo Rosasco

Neural Information Processing SystemsNov-21-2025, 09:53:03 GMT

We study the generalization properties of ridge regression with random features in the statistical learning framework.

artificial intelligence, machine learning, random feature, (13 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > United States > California > Los Angeles County > Long Beach (0.04)
Europe > France > Île-de-France > Paris > Paris (0.04)

Genre: Research Report > New Finding (0.69)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Add feedback

Universality of Group Convolutional Neural Networks Based on Ridgelet Analysis on Groups

Neural Information Processing SystemsAug-19-2025, 21:58:10 GMT

We show the universality of depth-2 group convolutional neural networks (GC-NNs) in a unified and constructive manner based on the ridgelet theory.

artificial intelligence, machine learning, ridgelet transform, (15 more...)

Neural Information Processing Systems

Country:

North America > United States (0.14)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > China > Beijing > Beijing (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Universality of Group Convolutional Neural Networks Based on Ridgelet Analysis on Groups

Neural Information Processing SystemsAug-19-2025, 21:58:06 GMT

We show the universality of depth-2 group convolutional neural networks (GC-NNs) in a unified and constructive manner based on the ridgelet theory.

artificial intelligence, machine learning, neural network, (14 more...)

Neural Information Processing Systems

Country:

North America > United States (0.14)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > China > Beijing > Beijing (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

b4f8e5c5fb53f5ba81072451531d5460-Paper.pdf

Neural Information Processing SystemsAug-17-2025, 00:35:32 GMT

artificial intelligence, descent, machine learning, (15 more...)

Neural Information Processing Systems

Country: North America > United States (1.00)

Industry: Government > Regional Government > North America Government > United States Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)

Add feedback

Estimating properties of a homogeneous bounded soil using machine learning models

Kalimeris, Konstantinos, Mindrinos, Leonidas, Pallikarakis, Nikolaos

arXiv.org Artificial IntelligenceJun-6-2025

This work focuses on estimating soil properties from water moisture measurements. We consider simulated data generated by solving the initial-boundary value problem governing vertical infiltration in a homogeneous, bounded soil profile, with the usage of the Fokas method. To address the parameter identification problem, which is formulated as a two-output regression task, we explore various machine learning models. The performance of each model is assessed under different data conditions: full, noisy, and limited. Overall, the prediction of diffusivity $D$ tends to be more accurate than that of hydraulic conductivity $K.$ Among the models considered, Support Vector Machines (SVMs) and Neural Networks (NNs) demonstrate the highest robustness, achieving near-perfect accuracy and minimal errors.

artificial intelligence, machine learning, soil property, (17 more...)

arXiv.org Artificial Intelligence

2506.04256

Country: Europe > Greece (0.14)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.87)

Add feedback

Curse of Dimensionality in Neural Network Optimization

Na, Sanghoon, Yang, Haizhao

arXiv.org Machine LearningFeb-7-2025

The curse of dimensionality in neural network optimization under the mean-field regime is studied. It is demonstrated that when a shallow neural network with a Lipschitz continuous activation function is trained using either empirical or population risk to approximate a target function that is $r$ times continuously differentiable on $[0,1]^d$, the population risk may not decay at a rate faster than $t^{-\frac{4r}{d-2r}}$, where $t$ is an analog of the total number of optimization iterations. This result highlights the presence of the curse of dimensionality in the optimization computation required to achieve a desired accuracy. Instead of analyzing parameter evolution directly, the training dynamics are examined through the evolution of the parameter distribution under the 2-Wasserstein gradient flow. Furthermore, it is established that the curse of dimensionality persists when a locally Lipschitz continuous activation function is employed, where the Lipschitz constant in $[-x,x]$ is bounded by $O(x^\delta)$ for any $x \in \mathbb{R}$. In this scenario, the population risk is shown to decay at a rate no faster than $t^{-\frac{(4+2\delta)r}{d-2r}}$. To the best of our knowledge, this work is the first to analyze the impact of function smoothness on the curse of dimensionality in neural network optimization theory.

artificial intelligence, machine learning, neural network, (19 more...)

arXiv.org Machine Learning

2502.0536

Country:

North America > United States > Maryland > Prince George's County > College Park (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Research Report (1.00)

Industry: Government (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)

Add feedback

Generalization Properties of Learning with Random Features

Alessandro Rudi, Lorenzo Rosasco

Neural Information Processing SystemsOct-3-2024, 12:13:58 GMT

We study the generalization properties of ridge regression with random features in the statistical learning framework. We show for the first time that O(1/ n) learning bounds can be achieved with only O( n log n) random features rather than O(n) as suggested by previous results. Further, we prove faster learning rates and show that they might require more random features, unless they are sampled according to a possibly problem dependent distribution. Our results shed light on the statistical computational trade-offs in large scale kernelized learning, showing the potential effectiveness of random features in reducing the computational complexity while keeping optimal generalization properties.

kernel, random feature, ridge regression, (12 more...)

Neural Information Processing Systems

Country:

North America > United States > New York (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > United States > California > Los Angeles County > Long Beach (0.04)
Europe > France > Île-de-France > Paris > Paris (0.04)

Genre: Research Report > New Finding (0.89)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Add feedback

Aspects of importance sampling in parameter selection for neural networks using ridgelet transform

Homma, Hikaru, Ohkubo, Jun

arXiv.org Artificial IntelligenceJul-26-2024

The choice of parameters in neural networks is crucial in the performance, and an oracle distribution derived from the ridgelet transform enables us to obtain suitable initial parameters. In other words, the distribution of parameters is connected to the integral representation of target functions. The oracle distribution allows us to avoid the conventional backpropagation learning process; only a linear regression is enough to construct the neural network in simple cases. This study provides a new look at the oracle distributions and ridgelet transforms, i.e., an aspect of importance sampling. In addition, we propose extensions of the parameter sampling methods. We demonstrate the aspect of importance sampling and the proposed sampling algorithms via one-dimensional and high-dimensional examples; the results imply that the magnitude of weight parameters could be more crucial than the intercept parameters.

neural network, regression, ridgelet transform, (17 more...)

arXiv.org Artificial Intelligence

2407.18655

Country:

Asia > Japan > Honshū > Kantō > Saitama Prefecture > Saitama (0.04)
North America > United States > New York (0.04)

Genre: Research Report > Experimental Study (0.48)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.49)

Add feedback

Filters

Collaborating Authors

integral representation

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Explicit integral representations and quantitative bounds for two-layer ReLU networks

fcc3dc27672a12510babe448d665e152-Supplemental-Conference.pdf

Generalization Properties of Learning with Random Features

Universality of Group Convolutional Neural Networks Based on Ridgelet Analysis on Groups

Universality of Group Convolutional Neural Networks Based on Ridgelet Analysis on Groups

b4f8e5c5fb53f5ba81072451531d5460-Paper.pdf

Estimating properties of a homogeneous bounded soil using machine learning models

Curse of Dimensionality in Neural Network Optimization

Generalization Properties of Learning with Random Features

Aspects of importance sampling in parameter selection for neural networks using ridgelet transform